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Abstract 

Capital distribution curve is defined as log-log plot of normalized stock capitalizations ranked in descending 
order. The curve displays remarkable stability over periods of time. 

Theory of exchangeable distributions on set partitions, developed for purposes of mathematical genetics and 
recently applied in non-parametric Bayesian statistics, provides probabilistic-combinatorial approach for analysis 
and modeling of the capital distribution curve. Framework of the two-parameter Poisson-Dirichlet distribution 
contains rich set of methods and tools, including infinite-dimensional diffusion process. 

The purpose of this note is to introduce framework of exchangeable distributions on partitions in the finan¬ 
cial context. In particular, it is shown that averaged samples from the Poisson-Dirichlet distribution provide 
approximation to the capital distribution curves in equity markets. This suggests that the two-parameter model 
can be employed for modelling evolution of market weights and prices fluctuating in stochastic equilibrium. 
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1 Introduction 


The capital distribution curve is defined as log-log plot of stock market weights ranked in descending order. 
Temporal stability of the shape of this curve is one of the cornerstones of the Stochastic Portfolio Theory (SPT), 
developed by Fernholz, Karatzas et al. ([9], [11] and [10]). In contrast to the MPT and the CAPM, which are 
based on normative assumptions, the Stochastic Portfolio Theory is a descriptive theory, since it studies empirical 
dynamics and characteristics of equity markets. In particular, the SPT captures tendency of stocks of retaining 
their ranks. The SPT model employs machinery of rank-interacting Brownian particles and semimartingales. 

Framework of partition structures, imported from mathematical genetics and comprising combinatorial and 
probabilistic methods, provides complementary approach for modeling and analysis of the capital distribution 
curve and can be summarized as follows. 

• The market is considered as a large combinatorial structure - partition of the set of the invested units of 
money. Capitalizations of individual stocks correspond to block or cluster sizes of the partition, represented 
by integers, for instance, measured in cents. 

• Number of set partitions defines number of ways each partition can be realized combinatorially. In other 
words, the market can be represented as a giant Young diagram with vector of capitalizations determining 
(potentially very large) number of ways such market configuration can be realized. 

Partition structures are important for several reasons. 

• First of all, partition structures provide a model of random transitions with dynamic dimensions. In other 
words, at any time number of diffusion components may change due to appearance of a new stock or 
bankruptcy of existing firm. 

• Second, partition structure, with non-trivial limiting distribution, defines asymptotic shape of the corre¬ 
sponding combinatorial structure. In particular, mechanism of shape formation provides an explanation of 
the phenomenon of stability of the capital distribution curve. 

The two-parameter Poisson-Dirichlet model is a remarkable and well studied instance of partition structures. 
It possesses analytically tractable limiting distribution defined in the simplex of ranked weights. 

Poisson-Dirichlet distribution. The Dirichlet distribution with m-dimensional vector of parameters 
(ai,...,am) defines probability for non-negative proportions in a standard simplex. Kingman [18] considered 
limiting behavior of this distribution with symmetric vector of parameters (a,..., a) such that 6 = ma = const 
for m —> 00 and called distribution of ranked components the Poisson-Dirichlet {VD) distribution (with one 
parameter 9). This distribution is defined in the infinite simplex of ranked weights, known as Kingman simplex 

V = {xi ^ X2 ^ ... I Xi ^ 0, X] = 1} 

Size-biased permutation provides an efficient method of sampling from the Dirichlet and the Poisson-Dirichlet 
distributions. In a framework of population biology Engen [5] suggested modification of the size-biased method, 
which produced another class of Poisson-Dirichlet distributions. It was called the two-parameter Poisson- 
Dirichlet distribution by Perman, Pitman and Yor, who rediscovered it in the context of studying of ranked 
jumps of gamma and stable subordinators (see [22],[26]). Monograph by Pitman [25] contains wealth of infor¬ 
mation on the two-parameter Poisson-Dirichlet model. As shown by Chatterjee and Pal [4], limiting behaviour 
of rank-interacting system of Brownian particles is characterized by the PI?(a,0) distribution. 

Aoki pioneered applications of exchangeable distributions in economics ([1],[2]), in particular using finitary 
characterization by Garibaldi, Costantini, et al. ([13], see also book [16]). Markov chain approach with transi¬ 
tions in space of partitions was independently developed by Garibaldi, Costantini, et al. [13], [14]. Petrov [23], 
inspired by works of Kerov, Fulman [12], Borodin and Olshanski [3] constructed a diffusion process preserving 
the two-parameter Poisson-Dirichlet distribution in the infinite-dimensional ranked simplex. 

This research note aims at illustration of applications of partition structures and the two-parameter model 
for modeling of stochastic evolution of the capital distribution curve. In particular, it is shown in Section 5 that 
the two-parameter model provides reasonable approximation of capital distribution curves in equity markets. 
Moreover the model also provides fit for distribution of relative total capitalizations of stock exchanges. 

Main results of this paper were presented at the 8th World Congress of the Bachelier Finance Society, 2014. 
The author is very grateful to Prof. I. Karatzas for useful advice and suggestions. 
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1.1 Capital distribution curve 

Log-log plot of ranked market weights displays 

• power law behavior, 

• concavity of the curve and 

• stability over periods of time 

For example, figure below shows capital distribution curves of the NASDAQ market on three dates in 2014. ^ 
As it can be seen from the chart most of market weights had relatively small fluctuations, despite significant 
fluctuations of NASDAQ market capitalization during that period of time. Stability of the capital distribution 
curve suggests certain independence of market weights and overall market capitalization. 



Figure 1: NASDAQ, capital distribution curves on May 27, Sep 24, Dec 9, 2014 
More detailed chart reveals behavior of weights of top 100 stocks. 



Figure 2: weights of top 100 stocks, NASDAQ 

Capital distribution curves on majority of equity markets, as well as distribution of capitalizations of world 
stock exchanges, have shapes similar to one shown at Figure 1. Section 5 contains examples of fit of these curves 
by the 7^I?-model. 


^Data source is http://www.google.eom/finance#stockscreener 
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1.2 Poisson-Dirichlet distribution and market weights 

Log-log plot of ranked samples from the Poisson-Dirichlet law is characterized by 

• power law behavior, 

• concavity of the curve and 

• stability around average shape 

The inhnite-dimensional Poisson-Dirichlet distribution generalizes symmetric finite-dimensional Dirichlet distri¬ 
bution. Moreover, as shown in Section 1.4, both distributions can be represented by normalization of sequences 
of random variables (j/i, ?/ 2 , ■•■) by their sum S = J2yj 

(2/1/s, j/ 2 /s, ...) 

with the property of independence of weights and the sum S. 

Figure below illustrates fit of NASDAQ market weights by averages of samples from the two-parameter 
distribution. Estimation of parameters is by least squares method. 


NASDAQ 



rank 

Figure 3: NASDAQ fit by VV{0.60, 55), (data as of Dec 9, 2014) 
Next figure displays typical behaviour of ranked random weights 


samples 



rank 

Figure 4: 20 sample paths of V'D{0.60, 55) 
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1.3 Ranked capitalizations and market weights 

Stock capitalization at time t is calculated as product of the shares outstanding and the stock price 


Cn{t) = Quit) ■ Pn{t) 


For capitalizations ordered as Cfi) (t) ^ ( 7 ( 2 ) (t) ^ 


corresponding ranked market weights are determined by 
C{n) (t) 


M(t) 


where M{t) = ^ (7„(t) is total market capitalization at time t. Stability of the capital distribution curve means 


« Ex(j)(t + At) 

In other words, ranked weights remain approximately the same despite changes in capitalizations. This im¬ 
plies that for relatively short periods of time, when the stock retains its rank numeraire approach of pricing 
approximately holds 

P(n) (t) ^ g Pin) + At) 

M{t) M(t-b At) 

However, it should be noticed that the longer the time period At, the less likely that stock retains its rank. More 
advanced approach of modelling market weights and stock capitalizations is based on application of diffusion 
theory and representation of the V'D{a, 9) distribution in terms of jumps of subordinators. This representation 
is known as Proposition 21 in the celebrated paper of Pitman and Yor [26]. 


1.4 Gamma-Dirichlet algebra 

There is close relationship between the gamma and Dirichlet distributions, characterized by number of important 
properties, which in the symmetric case can be summarized as follows. Let us consider m independent and 
identically distributed gamma variables jji ^ Q(a,c) with shape a and scale c. The first, convolution property 
states that the sum of these variables S = Vi ^bo has gamma distribution S ~ G{9,c) with 6 = ma. The 
second property states that normalized components Xi = yi/S are independent of the sum S, moreover, as it has 
been shown by Lukacs [20], this characterizing property holds if and only if yi are gamma distributed with the 
same scale c. Finally, normalized vector x = (yi/S, ...,ym/S) has symmetric Dirichlet distribution x ~ 'Dm{oi). 

Conversely, with Dirichlet distributed vector x ~ 'Dm{a) and independent gamma distributed S ~ Q(6,c) 
’restored’ variables yi = Xi • S, correspondingly, have gamma distributions yi ^ Q{a,c). 

Obviously, these properties hold as well in the case of the ordered Dirichlet distribution. For instance, 
with ranked components a;(i) ^ ... ^ a:(m) obtained from the symmetric Dirichlet distribution and independent 
S ^ Q{9,c), restored gamma variables y(m) = X(m) ' S are also ranked in descending order. 

Similar characterization of the VT>{a,9) law is provided by the Proposition 21 in Pitman and Yor [26], 
which informally can be restated as follows. Let us consider tempered stable subordinator ft with Levy density 
h'{y) = random time interval [0,T], with T ~ Q{0/a,l) and denote ranked jumps of the 

subordinator in this interval by ? 7 (i) ^ 77 ( 2 ) ^ .... Sum of these jumps is equal to value of the tempered 
subordinator stopped at random time T 

•S' = E ’nn) = It 

As in the case with the Dirichlet distribution, the Proposition 21 in [26] states that sum of the jumps S ^ Q{6, 1). 
The second statement of the proposition is that ^(q = r](i)/S are independent of the sum S. Finally, sequence of 
normalized jumps ^( 1 ) ^ ^( 2 ) ^ ... has the Poisson-Dirichlet distribution with parameters {a, 9). In what follows 
Prop. 21 provides convenient way of modeling stochastic evolution of stock prices ’restored’ from dynamics of 
market weights. 


1.5 PH-market model 

It is natural to employ stick-breaking and size-biased sampling methods described in Sections 3 and 4 for 
modeling diffusion with stationary Poisson-Dirichlet distribution. At first this approach was proposed by Feng 
and Wang [ 8 ], who also proved reversibility of corresponding infinite-dimensional process. Let us recall that the 
Wright-Fisher diffusion process Z = Z{t) driven by the SDE 

dZ = \ -Z)- a2Z] dt+^/Z{l - Z)dB 

has reversible stationary beta distribution Z* ~ H(Q;i,a 2 )- 
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If X„(0) denotes market weight of the n-th largest stock at time t = 0, then stochastic evolution of market 
weights can be determined from the stick-breaking process 

Xi(t) = = Z„(t)(i - X,(t)), 

where processes = Z„(t) are determined by independent SDEs 

dZn = 5 [(1 - a)(l - Zn) - {9 + an)Zn] dt + - Z„) di3„ 

with stationary beta distributions, corresponding to the size-biased sampling definition (7) 

Z^ ^ — a,9 + na) 

Initial values of processes Z„(0) are determined by 

Zi(0) = Xi(0), z„(0) = X„(0)/(1 - E”J/^.(0)) 

Local evolution of overall market capitalization M = M{t) can be modelled by diffusion 

dM =\[9- cM]dt + y/MdB 

with stationary gamma distribution M* ~ Q{9,c), where variable c is defined by condition M(0) = EM*. 
Correspondingly, local behaviour of stock prices is defined by product of independent processes 

P„(t) = —M(t)-X„(t), 

Qn 

where qn denotes number of shares outstanding. 



0 200 400 600 800 1000 

market capitalization 



0 200 400 600 800 1000 

Figure 5: Simulation of weights, overall market value and stock capitalizations 
with stationary 7^15(0.60, 55) distribution 
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1.6 The broken-stick model 


The broken-stick is a simple model illustrating how uniform partition produces inequality patterns. MacArthur 
[21] proposed this model for explanation of relative species abundances in closed environment. 

Let’s assume that stick of unit length represents some finite resource, such as territory, available food, water 
reservoir, etc., which must be shared between species. The resource is broken at random by throwing uniformly 
n — I cutting points on this stick and breaking it into n pieces. Length of each piece represents share, which 
is taken by some class of species. While on average length of each piece will be 1/n, ranked lengths of pieces 
display interesting behavior. 

For instance, if stick is broken just into two pieces, then length of smaller piece is never larger than 50% and 
since cut point is uniformly distributed it is easy to see that smaller stick on average represents 25% of length, 
while larger one takes 75%. In general it can be shown that after breaking stick into n pieces expected length 
of the k-th largest piece is given by 


Xk 


n 


E 

j=k 


1 

j 


In case of 3 pieces expected proportions ranked in descending order are 61.1%, 27.8% and 11.1%. It can be 
checked by straightforward simulation that dropping 4 points at uniform on unit interval produces on average 
following ranked lengths of 5 subintervals 


(46%, 26%, 16%, 9%, 4%) 

Obviously, sampled proportions will fluctuate around these expected lengths. For larger values of n ranked 
expected proportions start to decay rapidly and it is more convenient to display them on a log-log plot. 



Figure 6: Expected proportions for n = 10,25,50 

This example illustrates that asymmetry in ranked proportions appears with completely uniform distribution 
of resource. 
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1.7 Toy model 

Let us imagine that there are only two stocks with capitalizations 3 and 2 in the market with capitalization 5. 
Tickers or names do not play important role and used only to distinguish the stocks. Ten ways in which 5 units 
of money can form a state with these capitalizations is represented by the ten Young tableaux shown on the left 


1 

2 

3| 

1 

2 


1 

3 

4| 

1 

5 


4 

5 


3 

5 


2 

5 


2 

3 

4 


1 

2 

5| 

1 

2 


1 

3 

5| 

1 

3 


1 

4 

5 

1 

4 


3 

4 


3 

4 


2 

4 


2 

4 


2 

3 


2 

3 

5 


Since these partitions have the same block sizes it is convenient to use Young diagram, shown on the right, 
to denote all partitions with the same shape. The 10 partitions above arise by adding a new box: 


in one way to 4 partitions with shape EP 
and 

in two ways to 3 partitions with shape EB 


^ = 4 ,^ 





For a Young diagram c combinatorial formula (1) provides /i(c) equal to 

• number of set partitions with given block sizes, 

• which in financial terms is the same as number of ways the market can form a state described by ranked 
capitalizations. 

Exchangeable probability distribution on partitions assigns the same probability to all partitions with the same 
shape. This framework is useful when one is interested in studying distribution of ranked block sizes (capitaliza¬ 
tions) regardless of block labels (tickers). If (c) denotes probability of a partition with n elements and shape 
c, then total probability of all partitions with this shape is 

Pn(c) =m(c) •7rn(c) 

Obviously sum of these probabilities over all shapes (Young diagrams) with n elements must be 1. 

In a context of mathematical genetics Kingman [19] considered family of distributions {Pp} on partitions 
of n = 1,2,3.. elements and noticed that random sampling induces natural consistency constraints connecting 
distributions for levels n — 1 and n. He called distributions {Pp} satisfying such constraints partition structures. 

Continuing the example, let’s consider 10 partitions with the shape EEP- In each of these partitions any of 5 
boxes can be removed so remaining partitions will have 4 elements. For each partition there are 


• 2 ways to get to 

• 3 ways to obtain EEI 




X, 


2/5 




^ 3 / 5 — 











] 


Uniform deletion of a box on partitions of 5 elements induces probability distribution on 4-partitions. For 
example 

3 /^\ 1 


P4 


- 5P5 


IP5 


On the other hand this consistency constraint defines forward conditional probabilities of partition growth, for 
instance 

P(ffl^ff) = ^p5(^)/P4(ffl) 

In other words partition structure is a family of distributions on partitions consistent under growth (or Up 
moves) and recession (or Down moves). This enables considering market dynamics as a process of combinatorial 
random walks on partitions driven by sequences of up or down transitions. 





























































































2 Exchangeable partitions 

2.1 Descriptions of set partitions 

Galaxies, stars, companies, people form clusters and sizes of these clusters are rarely uniform. In finance and 
economics stocks and companies can be considered from the following point of view. 

• Stock market comprise k stocks with total capitalization n of units of money. If rii denotes value of i-th 
largest company (by capitalization), then market weights are given by Xi = rii/n and vector x = (a;i, X2,..) 
represents capital distribution curve. 

New unit of money can join any of the stocks thus increasing capitalization of particular stock to + 1 
and total capitalization to n + 1, or unit of money can leave a stock decreasing corresponding stock and 
market capitalizations by 1. Also there is a possibility that a new stock will be issued during the IPO, 
which leads to increase of number of clusters to k + 1. 

• In the same way companies assets/values may experience increase or decrease. Also there is a possibility 
that a new company enters the market. 

• In mutual funds industry, money coming to the market join existing funds proportionally to their size, but 
there is always opportunity that new fund emerges. 

Process of clustering can represented by partitions of a set. For instance, the set of three letters ’a’, ’b’ 
and ’c’ can be partitioned as shown below in the left column, with corresponding Young diagrams in the right 
column representing partition classes : 


{a, 6, c} 


{a,6},{c} 



{a,c},{6} 


— 1 

{b, c},{a} 



{a},{&},{c} 





If in set partition, represented by clusters/blocks, cluster labels are not important and order of items inside 
of each cluster is irrelevant, then such partition called exchangeable. Such partitions have the same shape 
and completely described by vector of their block sizes. Partitions with the same shape belong to the same 
exchangeable class (or partition class). For the example with three companies, the set {a, 6, c} has 5 partitions 
and 3 exchangeable classes, represented by Young diagrams in the right column. 

Every exchangeable partition of n elements into k clusters (blocks) can be described in two ways. 

1. For the first order description, since labeling of clusters is not important, it is convenient to consider cluster 
sizes arranged in descending order 

ni ^ n2 ^ ... ^ Uk, 

where Ui denotes size of i-th. largest cluster, hence n = ni + ■ ■ ■ + nk 
In population biology terminology it is called a frequency vector: 

n = [ni,n2, ...Uk] 

Obviously Young diagrams correspond to this description. 

2. For the second description, let ci denote number of clusters of size one, C 2 denote number of clusters of 
size two, etc. If Cj denote number of clusters with j items then total number of items is 

n = Cl+ 2 ■ C 2 + ■ ■ ■ + m ■ Cm 

and number of clusters is given by A: = ci + • • • + Cm where m is the size of the largest cluster. To distinguish 
partition vector c from frequency vector {}-notation is used 

c = {C1,C2,. .. } 

Let c Ih (n, k) denote that vector c describes a partition of n elements into k clusters. 
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2.2 Size of exchangeable class 

Every exchangeable class contains set partitions, represented by the same partition vector c = {ci, C2, C3,...}. 
Number of partitions in a class, described by vector c, is given by 


^i{c) 


n\ 


( 1 ) 


Indeed, by multinomial formula number of partitions with ci one-element subsets, C 2 two-element subsets, etc. 


... 

Cl times C2 times 

which must be divided by Cjl since permutations of blocks of the same size play no role. 
For instance, the 4 element set { 0 , 6 , c,d} has 15 partitions and 5 exchangeable classes: 


{a, b, c, d} 

1 1 1 1 

n=[4] 

c = {0,0,0,1} 

4' 

= (4,)i = 1 


{a,b, c},{d} 
{a,b, d},{c} 
{a,c, d},{b} 
{b,c, d},{a} 



n = [3,1] 

c = {l,0,1,0} 

u ( c ) 1 




{a,b},{c, d} 
{a,c},{&, 4 
{a,d},{b,c} 



n= [2,2] 

c = (0,2,0,0} 

Me) = 2,(2,,2 = 3 

{a,6},{c},{4 

{a,c},{6},{4 

{a,d},{5},{c} 

{5, c},{a},{d} 

{5, 4,{a},{c} 

{c, d}, {a}, {b} 


□ 

n = [2,1,1] 

c = {2,1,0,0} 

2!(l!)2(2!)i 

{a},{&},{c},{4 



n= [1,1,1,1] 

c = (4,0,0,0} 

“ 4!(1!)4 “ ^ 


Interestingly that partition n = [2,1,1] can be realized in 6 ways and uniform partition n = [2,2] only in 3 ways. 


2.3 Partition structure 

If all partitions from the class with partition vector c are considered to be equivalent then they should have the 
same probability. 

If 7 rn(c) denotes probability of an element from the partition class c with /r(c) elements then probability of 
that exchangeable class is 

Pn(c) =m(c) •7rn(c) 

Obviously these probabilities should satisfy 

EPnW = l (2) 

c lh(n) 

Here c Ih (n) denotes that summation runs over all classes of partitions of n elements. 

Partition structure. In general, it is not enough to assign probability measures over partition classes for 
all values of n. Kingman [19] noticed that besides (2) there are consistency conditions linking exchangeable 
probability measures Pn_i and and called such consistent sequences of {p|} partition structures. 
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2.4 Ewens-Pitman Sampling Formulae 

A finite dimensional counterpart of one-parameter stick-breaking model (6) has been proposed by Ewens in the 
context of population biology. Given partition vector c Ewen’s Sampling Formula assigns probability as 


p(c) 


nl 

0H 


(3) 


Pitman studied size-biased representation of the two-parameter model (7) and obtained corresponding extension 
of Ewen’s sampling formula in [24]. The two-parameter Pitman’s Sampling Formula (PSF) gives probability for 
partition class c 


p(c) 


6»N il ) 


(4) 


where = 9{9 -I- a) • • • (6* -I- a{k — 1)), which shows that for a = 0 the formula converges to (3). 

Kerov[17] proposed that formula (4) can be obtained via model of random allocation with conditionally 
independent variates. 


2.5 Chinese Restaurant Process 

Chinese Restaurant Process provides probabilistic dynamics of partitions, ensuring that probabilities remain 
exchangeable. Zabell [28] explains the metaphor as ”on any given evening in Berkley a large number of people 
go to some Chinese restaurant in the downtown area. As each person arrives, he looks in the window of each 
restaurant to decide whether or not to go inside. His chances of going in increases with the number of people 
already seen inside... But there’s some probability that he goes to an empty restaurant..” 

More formally, it is assumed that there are infinite number of tables (restaurants) and first customer always 
sits at hrst unoccupied table, say table 1. Customer n -|- 1 observes occupied k tables and 

• joins table with Ui people with probability 


Pi = 


Ui — a 
n + 9 


• joins new, unoccupied table with probability 


P 


* 


9 + ak 
n + 9 


It is important that this process provides exchangeable probability on partitions. 
For instance partition n = [2,1,1] can migrate to following states 
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2.6 Infinite dimensional diffnsions as random walks on partitions 

Diffusion processes in the ordered infinite simplex were developed by Petrov, Olshanski, Borodin ([3],[23], also 
[12]), by Feng et al. ([8], [7] and [6]) and Ruggiero and Walker ([27]). Independently, Markov chain induced by 
down- up- transitions was studied by Costantini, Garibaldi et. al. ([15], [14]). 

The idea of approximating a two-parameter diffusion process is relatively simple. Let’s fix some large n and 
starting with some partition let’s consider Down and Up jumps, where 

• down move randomly deletes one cell in Young diagram 

• up move acts according to the Chinese Restaurant Process 

For instance, figure below shows possible transitions between partitions of 4 and corresponding DU-chain: 



(a) DowN-chain 


(b) Up-chain 

Figure 7: Example of Down-Up transitions 


(c) DU-chain 


D- and U- operators preserve probability structures given by Ewens-Pitman Sampling formulae, hence ob¬ 
tained Markov chain also preserves this distribution. Figure below illustrates approximation of diffusion process 
in Kingman simplex with parameters a = 0.3, 9 = 5 



Figure 8: Diffusion sample paths on partitions, top five Xi{t) 
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3 Dirichlet distribution and size-biased sampling 


3.1 Dirichlet distribution 

The broken stick model is a particular case of more general Dirichlet distribution, which is a probability measure 
over vectors of proportions. Its density function is parametrized by vector cx = (ai,..., am) 


p(x) 


r(S«z) 
r(ai) • • •r(am,) 


X 


ai- 

1 


1 


-1 


The distribution is defined for vectors of proportions from the simplex 


= {x G K"*! X)™ 1 = 1, ^ 0} 

It is convenient to denote that vector x G Am has m-dimensional Dirichlet distribution described by vector of 
parameters a as 

X ~ Vm{a) 

An alternative and equivalent definition of the Dirichlet distribution, which makes proofs of its properties almost 
immediate is given by normalization of vector of gamma variables. For m independent gamma distributed 
variables yi ^ G(ai) vector of proportions defined by 


yi ^ ^ ^ Vm 

Vi’ ’ E 2/i 

has Dirichlet distribution x ^ Vmicx.). 

Gamma distribution possess convolution property 

for yi ~ G{ai),y 2 ~ 0 ( 02 ) 2/i + 2/2 ~ G{ai + a 2 ) 



This property together with Lukacs characterization, saying that it is unique distribution which possesses inde¬ 
pendence of 2 / 1 /j /2 from yi + y 2 , simplifies proofs of the properties below. 


Properties. Let 9 = EEi denote sum of parameters. If 2/0 = EjLi Vj denotes sum of independent gamma 
variables yi ^ G(ai), then this sum has gamma distribution j/o ^ G{9) by convolution property. 

If component yt is separated and others are lumped together, then vector (2/i) 2/j) ^las independent 

components with gamma distributions with parameters ai and 6 — ai correspondingly. From normalization by 
2/0 it follows that marginally Xi = yi/yo have beta distributions 


Xi ~ B(ai, 9 — ai) 


Let X[_j] denote that i-ih. component is removed from the vector x 

X[_2] (Xl ,.. .X^_l , + 1 ,.. .3^771 ) 


Since in a vector x G Am sum of components is 1, in a vector X[_i] sum of components becomes 1 — Xi and 
therefore normalized vector X[_i]/(1 — Xi) belongs to A^-i simplex. 

The Dirichlet distribution possesses an important property of neutrality that is independence of Xi and X[_j]/(1 — 
Xi). Moreover, if original vector x ~ Timioc), then 


X 


-i] 


I - Xi 


'^m—1 (^[—z]) 


This property is trivial for m = 2. For arbitrary dimension, let’s consider vector with m independent gamma 
variates 2/z and corresponding Dirichlet distributed vector with components Xj = yj/yo- Removing i-th compo¬ 
nent from both vectors and normalization of the second one yields for j i 


1 - Xi 


1 - Vi/Vo 


Vj Vo 
Vo yo - Vi 


Vo 


Independence (neutrality) follows from Lucacs property. 
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Beta stick-breaking. These properties suggest a stick-breaking method of sampling from the Dirichlet 
distribution. Let’s imagine that a stick of unit length is broken by the following step-by-step procedure. By 
marginal property the first component Xi = Zi can be modeled as 

zi -- B(q;i, 6> - ai) 

By neutrality property the first piece can be broken off. The remaining components have 'C>m-i{(^[-i]) distribu¬ 
tion. The procedure is repeated for remaining part of stick with length \ — xi with 


Z2 B(a2,0 — oi — 02 ) 

22 1-22 



Xl 

X 2 = 22(1 - 2 l) 



1 - 2i 


producing new piece X 2 = 22(1 — zi) and residual with length (1 — Z 2 )(l — zi) = 1 — xi — X 2 etc. 

In general by stage k 

k—l , k — 1 ^ 

Xk = ZkY\{l- Vj) = Zfc f 1 - ccj ) 
i=i i=i ^ 

For k = m the last component is simply a remainder. In the case of symmetric Dirichlet distribution with ai = a 
breaking rule simplifies to 

Zk B(q;, 9 — ka) 

Xk = Zfc(I -Xi - Xk-l) 


3.2 Size-biased sampling 

Stick-breaking method from the previous paragraph samples proportions from the Dirichlet distribution com¬ 
ponent by component. In many applications as well as in the development of the Poisson-Dirichlet distribution 
it is more important to study proportions ranked in descending order 


X(l) ^ X(^2) ^ . 


so it would be more convenient to have a procedure which gives as output order statistics from distribution. 
In general this is not easy, however it is possible to devise a simulation strategy which provides samples in 
proportions of the appearance in the real distribution - which is given by size-biased sampling. 

Let’s suppose that given a Dirichlet distributed vector x ~ Dmict) one of its components is chosen at random, 
such that Xj is the probability of choosing j-th component. Alternatively it can be visualized as dropping a point 
at uniform over stick of length one, which is divided by proportions xi,X 2 , ■■ and choosing proportion/piece at 
which the point falls on. Value of this proportion is called size-biased sample and obviously chances to choose 
largest piece are highest, etc. Once proportion is chosen, it is set apart and the procedure is repeated with 
normalized residual. The outcome is the size-biased permutation of vector x where components are randomly 
interchanged, with bias towards ordered case. 

Density function of the first size-biased pick in the permuted vector can be found by the following argument. 
Proportion x may be picked with probability 21 as a first component of a vector (x,X 2 , ■■■,Xm) or as a second 
component of a vector {xi,x,X 3 , ...,Xm), etc. In each of these particular cases finding unconditional probability 
amounts to marginalization over Dirichlet density, which yields beta densities Uai.e-a;(2?) and therefore total 
probability is 

m 

p{x) = ^ X • BaiP-ai (x) 

In case of symmetric Dirichlet distribution density of the size-biased pick simplifies to 


P{x) = '■ 


aT(am) 


Q!r(Q;)r(0 — a) 


^{1-xf 


r (0 + i) 


r(a-t l)r(6»-a)‘ 




14 













In other words, first SBP xi = yi has beta distribution with shifted parameter 

xi ~ B(1 + a, d — a) 

After breaking off the first SBP and applying the procedure over and over again it can be shown that piece to 
be broken off from the remaining part of the stick has beta distribution 

'zk ~ B(1 + a, 0 — ak) (5) 

with corresponding proportions 

Xk = Zki^ -Xi - Xk-l) 

Since 0 = am simulation terminates at stage k = m — 1 with Xm is length of the remainder. 

Samples obtained this way will have tendencies for larger proportions appearing hrst, followed by smaller 
proportions. Obviously, after ranking of proportions both methods (standard and size-biased ones) produce 
identically distributed sequences, since SBP only randomly permutes components of symmetric Dirichlet vector. 

At first it is not clear what is the purpose of the SBP, since components of ordered Dirichlet distributed 
vector can be sampled by standard procedures and then ranked. However, as it will be shown below, size-biased 
simulation allows sampling from directly not accessible cases. 

4 The Poisson-Dirichlet distribution 

4.1 One-parameter family 

For symmetric m-dimensional Dirichlet distribution 'Dm{a) let’s consider limiting case, where dimensionality 
m —>■ oo such that 9 = am, which means that while dimension goes to infinity the total charge 9 remains the 
same and individual parameters ca = 0. In this case direct application of the standard stick-breaking 

method is impossible since we will have to sample from B(£, 9 — e) where e is infinitesimally small. However, for 
any m, it is possible to consider size-biased stick-breaking with 

Zfc ~ B(I -I- 9/m, 9 — k ■ 9/m) 


which gives for m —)■ oo 


2fc-B(l,d) 

Xk = yfc(l -xi - Xk-i) 


Ranked values of size-biased sequence {xk} have one-parameter Poisson-Dirichlet distribution V'D{9) 


( 6 ) 


^( 1 ) ^ ^( 2 ) !?■••• 

It is important to not that in contrast to (5) sequences of 'zk and Xk are infinite, since ultimately they correspond 
to the infinite dimensional Dirichlet distribution. 

An alternative way of sampling consists in consideration of jumps of gamma subordinator in time interval 
[0, 9] and normalization of ranked jumps in this interval. 


4.2 Two-parameter family 

Engen[5] noticed that valid size-biased stick-breaking model (5) holds for negative values of parameter in the 
range a € (—1,0], which after relabeling (—a) i —> a leads to the following sampling method 

^ B(1 — a, 0 -I- a/c) (7) 

where as before corresponding partitions of unit interval are given by 

Xk = yfc(l -xi - Xk-i) 

The two-parameter Poisson-Dirichlet distribution is defined as distribution of ranked values of sequence {xk} 

X{1) ^ X{^2) ^ . • . 

Obviously in the setting of (7) range of parameter a is —0 < a < 1. For values a < 0 such that 9 = ma sequence 
Xk eventually stops and corresponds to the Dirichlet distribution . When 0 ^ a < 1 as well as in one-parameter 
case sequence Xk does not stop and thus for this range of parameter a this model is inhnite-dimensional. 
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5 Market capitalizations and VT>{a, 6) 

Plots in this section illustrate results of fit of capital distribution curves for several international stock exchanges 
by averages of the Poisson-Dirichlet law. Blue circles correspond to ranked relative capitalizations and red lines 
represent averages of several samples from the two-parameter Poisson-Dirichlet distribution. Optimal fitting is 
by least-squares method. 

The first example shows that the Poisson-Dirichlet distribution provides fit for ranked normalized capitaliza¬ 
tions of world stock exchanges. Data source is http: //www. world-exchanges. org/statistics/monthly-reports, 
data are as of end of November, 2014. 


World markets 



Figure 9: NYSE, NASDAQ, Japan, EURONEXT, Hong Kong,... (a = 0.44,6» = 18) 

Eigures below illustrate results of modeling of capital distribution curve in major stock exchanges. Data 
source is http://www.google.coin/finance#stockscreener, data are as of December 9, 2014. Possible expla¬ 
nation of these results is provided by the hypothesis that the two-parameter model approximates underlying 
partition structure in corresponding markets. This suggests that stochastic evolution of ranked market weights 
can be modeled by the two-parameter diffusion process or its modification. 



Figure 10: Australia {a = 0.45,0 = 18) 
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“ weight “ 


Belgium 



Figure 11: Belgium (a = 0.73,0 = 19) 


Brazil 



Figure 12: Brazil {a = 0.1,0 = 50) 


Canada (Toronto) 



Figure 13: Canada (w/o CNOOC) {a = 0.75,0 = 40) 
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weight 


China (Shanghai) 



Figure 14; China (Shanghai) {a = 0.57 ,9 = 20) 


France 



Figure 15: France {a = 0.35,0 = 20) 


Germany (Frankfurt) 



Figure 16: Germany {a = 0.2, 9 = 34) 
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Hong Kong 



Figure 17: Hong Kong {a = 0.47 ,9 = 40) 


India (Bombay) 



Figure 18: India {a = 0.43,6» = 37) 


Israel 



Figure 19: Israel {a = 0.65 ,9 = 45) 
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Korea 



Figure 20: Korea (a = 0.55 ,9 = 45) 


Italy 



Figure 21: Italy (a = 0.3 ,9 = 15) 


Japan 



Figure 22: Japan {a = 0.48,0 = 95) 
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weight weight weigM 


Russia (MICEX) 



Figure 23: Russia (a = 0.2 ,9 = 15) 


Singapore 



Figure 24; Singapore {a = 0.51,0 = 20) 


Spain 



Figure 25: Spain {a = 0.02,0 = 16) 
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weight weight weight 


Switzerland 



Figure 26: Switzerland {a = 0.4,0 = 9) 


Taiwan 



Figure 27; Taiwan (a = 0.5,0 = 50) 


Turkey 



Figure 28: Turkey {a = 0.24,0 = 26) 
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UK (London) 



rank 

Figure 29: United Kingdom (a = 0.65,0 = 15) 


NASDAQ 



Figure 30: United States (a = 0.60 ,9 = 55) 


NYSE 



Figure 31: United States {a = 0.28,9 = 255) 
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5.1 S&P 500 drawdowns 


Figure in this section illustrates another application of the two-parameter Poisson-Dirichlet distribution providing 
fit to normalized relative daily drawdowns of S&P 500. 



rank 

Figure 32: S&P 500 relative daily drawdowns, from 1950 to 2014; 0 = 80, a = 0.24 


6 Summary 

The central idea of the proposed approach is probabilistic-combinatorial and can be summarized as follows: 

• combinatorics: Stock market is considered as large combinatorial structure - partition of the set of all 
invested units of money. Stock capitalizations, represented by integers, define partitions of the market 
value. Number of ways this state can be realized combinatorially is given by the formula (1). 

• probability: Partition structure for each level n ^ 1 defines exchangeable probability for all partitions 
with n elements, such that distribution on level n is consistent with distribution on partitions with n + \ 
elements. Such consistency conditions determine up (n —)■ n -|- 1) and down (n —)■ n — 1) conditional 
probabilities of transitions. 

• The two-parameter Poisson-Dirichlet distribution, defined in the infinite simplex with ranked weights 
has corresponding partition structure, given by the formula (4). Stick-breaking construction provides 
size-biased method of sampling from the two-parameter model. Associated diffusion process, induced by 
down/up Markov chains has V'D{a,9) law as unique reversible and therefore equilibrium distribution. 

Results of Section 5 suggest the hypothesis that the two-parameter model approximates stationary distribu¬ 
tion of capital distribution curve as well as corresponding underlying partition structure. It is proposed that 
vector of the ranked market weights (capital distribution curve) fluctuates in stochastic equilibrium, which can 
be modelled by means of the two-parameter diffusion process, or by combinatorial random walks on partitions. 
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